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Abstract 

Possible distributions are discussed for intertrade durations and first-passage pro- 
cesses in financial markets. The view-point of renewal theory is assumed. In order to 
represent market data with relatively long durations, two types of distributions are 
used, namely, a distribution derived from the so-called Mittag-Leffler survival func- 
tion and the Weibull distribution. For Mittag-Leffler type distribution, the average 
waiting time (residual life time) is strongly dependent on the choice of a cut-off 
parameter i max , whereas the results based on the Weibull distribution do not de- 
pend on such a cut-off. Therefore, a Weibull distribution is more convenient than a 
Mittag-Leffler type one if one wishes to evaluate relevant statistics such as average 
waiting time in financial markets with long durations. On the other side, we find 
that the Gini index is rather independent of the cut-off parameter. Based on the 
above considerations, we propose a good candidate for describing the distribution of 
first-passage time in a market: The Weibull distribution with a power-law tail. This 
distribution compensates the gap between theoretical and empirical results much 
more efficiently than a simple Weibull distribution. We also give a useful formula 
to determine an optimal crossover point minimizing the difference between the em- 
pirical average waiting time and the one predicted from renewal theory. Moreover, 
we discuss the limitation of our distributions by applying our distribution to the 
analysis of the BTP future and calculating the average waiting time. We find that 
our distribution is applicable as long as durations follow a Weibull-law for short 
times and do not have too heavy a tail. 
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1 Introduction 



The distribution of time intervals between price changes gives us important 
pieces of information about the market pQ. In particular, the fact that inter- 
trade durations are not exponentially distributed rules out the possibility of us- 
ing pure-jump Levy stochastic processes (i.e. compound Poisson processes) as 
models for tick-by-tick data. Levy processes have stationary and independent 
increments and are Markovian and all these properties are a consequence of 
exponentially distributed waiting times [1] . Other models have been proposed 
such as non-homogeneous compound Poisson processes, GARCH-ACD mod- 
els, continuous-time random walks and semi-Markov processes [2.3.4,5,6.7]. 

Recently, various on-line trading services on the internet were established by 
several major banks. For instance, the Sony Bank uses a trading system in 
which foreign currency exchange rates change according to a first-passage pro- 
cess. Namely, the Sony Bank USD / JPY exchange rate is updated only when 
a reference market rate fluctuates by more than or equal to 0.1 yen [_]. As a 
result, in the case of the Sony Bank rate, the average duration between price 
changes becomes longer, passing from 7 seconds to 20 minutes. Automatic 
FOREX trading systems such as the one offered by the Sony Bank are very 
popular in Japan where many investors use a scheme called carry trade by 
borrowing money in a currency with low interest rate and lending it in a cur- 
rency offering higher interest rates. As Japanese bond yields are low and US 
bonds offer higher interest rates and are rated as safe financial instruments, 
there is much trade in the USD/JPY market. 

In this paper, we wish to compare the time structure of the Sony bank trades 
with other markets such as BTP futures (BTP is the middle and long term 
Italian Government bonds with fixed interest rates) once traded at LIFFE 
(LIFFE stands for London International Financial Futures and Options Ex- 
change) . 

From the view-point of complex system engineering, a relevant quantity used 
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to specify the stochastic process of the market rate is the average waiting time 
(a.k.a. residual life time) rather than the average duration. In a series of recent 
studies by the present authors, the average waiting time of the Sony Bank 
USD/JPY exchange rate was evaluated under the assumption that the first- 
passage time (FPT) is a renewal process whose distribution obeys a Weibull- 
law. We found that, counter-intuitively, the average waiting time of Sony Bank 
USD/JPY exchange rate is more than twice of the average duration This 
fact is known as inspection paradox. It means in general that the average of 
durations is shorter than the average waiting time. This fact is quite counter- 
intuitive because the customer checks the rate at the time between arbitrary 
consecutive rate changes. We shall explain the interpretation of this fact for 
the case in which durations follow the Weibull distribution. 

The Weibull distribution is often used for modelling intertrade durations in 
financial markets PTTU] . On the other side, the so-called Mittag-Leffler survival 
function has been also proposed to represent the distribution of durations in 
several markets. For example, Mainardi et al. [11] showed that BTP future 
inter-trade durations are well- described by a survival function of Mittag-Leffler 
type. However, up to now, the Mittag-Leffler survival function has never been 
applied to evaluation of the average waiting time as it has infinite moments 
of any integer order. 

In this paper, we compare a Weibull distribution with a Mittag-Leffler type 
survival function in order to evaluate the average waiting time. We give an 
analytical formula for the average waiting time under the assumption that the 
FPT distribution might be described by a Mittag-Leffler survival function. We 
find that the average waiting time diverges linearly with respect to a cut-off 
parameter t max - This fact tells us that it is hard to handle the Mittag-Leffler 
survival function to evaluate relevant statistics such as the average waiting 
time. We next evaluate the Gini index as another relevant statistic to check 
the usefulness of the Mittag-Leffler survival function. 

We also provide a good candidate for the description of the first-passage pro- 
cess of the market rates, namely, a Weibull distribution in which the be- 
havior of the distribution changes from a Weibull-law to a power-law at some 
crossover point t x . We find that the average waiting time becomes much closer 
to the empirical value for the Sony Bank USD/JPY exchange rate than for 
a pure Weibull distribution. We also give a useful formula to determine the 
optimal crossover point in the sense that the gap of the average waiting time 
between the empirical and the proposed distributions is minimized for the 
crossover point. Moreover, we discuss the limitation of our distribution by ap- 
plying our distribution to the analysis of the BTP future and calculating the 
average waiting time. We find that our distribution is applicable as long as 
duration follows a Weibull-law in short duration regime and does not have too 
heavy a tail. 
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As mentioned above, in this paper, two sets of data are used. The first set 
comes from the Sony bank and the random variable analysed is a first-passage 
time, whereas the second set is made up of future BTP prices traded at LIFFE 
in 1997 for two different maturities: June and September. For these data, the 
relevant random variable is an intertrade duration. Both data sets have already 
been studied and extensively described in previous papers ( |9llllfl2|ll3|ll4|ll5| ). 
In both cases, we assume that the empirical random variables are a realization 
of a renewal process. A renewal process is a one-dimensional point process 
where at times T , T 1; . . . , T n , . . . some event takes place, and the differences 
Tj = Tj — Tj_i are independent and identically distributed (i.i.d.) random 
variables, so that T n = I^lLi 7 "*- Therefore T n can be seen as a sum of non- 
negative i.i.d. random variables, that is as an instance of random walk. For 
the Sony bank data the incoming events are price changes due to crossing the 
±0.1 yen level around the current price, whereas in the BTP-future case, the 
events are consecutive trades. Therefore, in the Sony bank case, the waiting 
time is the residual life-time to next passage and in the BTP-future case, the 
waiting time is the residual life-time to the next trade. 

This paper is organized as follows. In the next section, we introduce both the 
Mittag-Leffler survival function and the Weibull distribution. Then, we dis- 
cuss their properties in detail. In section 3, we evaluate the average waiting 
time for the Mittag-Leffler survival function. We find that the average waiting 
time diverges linearly as a function of the cut-off parameter t max . In section 
4, we provide a theoretical formula of the Gini index for the Mittag-Leffler 
function and we check the usefulness by comparing the theoretical prediction 
with empirical data analysis for the BTP future. In section 5, we introduce 
a Weibull distribution with a power-law tail to compensate a small gap be- 
tween the results of theoretical and empirical data analysis for the average 
waiting time. In the same section, we give an intuitive explanation for the 
non-monotonic behavior of the average waiting time corrected by means of 
the Weibull distribution with a power-law tail. From the observation, we ob- 
tain a useful formula to determine the optimal crossover point for which the 
gap between theoretical prediction and the empirical data analysis for the av- 
erage waiting time is minimized. In section 6, we apply our distribution to the 
BTP future to check the limitation of our approach. In the final section 7, we 
summarize and discuss our results. 



2 Mittag-Leffler survival function and Weibull distribution 

For BTP-future data, the successive time intervals are reasonably described 
in terms of the Mittag-Leffler survival function [TT] : 



4 



0.1 



0.01 



0.001 



p = 0.96 
power-law 
streched exponential-law 




10 



100 



t 



1000 



10000 100000 



Fig. 1. The behavior of the Mittag-Leffler survival function (fT]). The parameters are 
P = 0.96 and t Q = 1200. 



where T(z) denotes the Gamma function; we set the upper bound of the sum 
to a large value n max for practical numerical calculations. The above Mittag- 
Leffler survival function has asymptotic forms: 

Ep(-(t/t f) ~ exp[-(t/t ) /3 /r(l + 0)\ (t/to -> 0) 

(stretched exponential) and 

E p (-(t/t f) ~ (t/to)- p /T(l - 0) (t/t - oo). 

We illustrate these asymptotic forms in Fig. [TJ Then, the density function of 
the duration t is given by 



Pml\P '■ to, P) = 



dE p {-{t/tof) = 1 » (t/tp) 
^ " t £^ J r(/?r 



r(/3n + /3) 



(2) 



In the limiting case P — 1, the Mittag-Leffler distribution coincides with the 
exponential distribution. On the other hand, the so-called Weibull distribution 
has a probability density function given by 



Pw(t : m, a) 



m exp ( 

a V a 



(3) 
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and is a good approximation to the passage times for the Sony Bank USD/JPY 
exchange rate in a non- asymptotic regime t <C oo. It can be directly verified 
that the Weibull distribution ([3]) becomes an exponential distribution for m = 
1 and a Rayleigh distribution for m = 2. 

For these two candidate distributions, we study a relevant statistic: the average 
waiting time, a quantity used in queueing theory, which has been defined in 
the introduction as the residual life-time for a renewal process. 



3 Divergence of the average waiting time for the Mittag-Leffier 
survival function 

The first two moments of the Mittag-Leffier distribution diverge. For the aver- 
age of a random variable t, we use the notation E(t). It can be shown that also 
the residual life-time, defined as the ratio of the first two moments of the distri- 
bution diverges. One possibility is truncating the Mittag-Leffier distribution at 
some time i max and normalizing to J max -Pml(^) dt. This distribution has finite 
moments of all orders and it turns out that the waiting time w = E(t 2 )/2E(t) 
is: 



In Figure [21 we plot the w for several values of t max with t 1 = 1/12. However, 
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Fig. 2. The average waiting time w by ([4]) with t, 
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we should keep in mind that the above w diverges as i max — > oo. As we saw, the 
asymptotic form of the above density function is ~ t~ 1 ~ 13 when t/t — > oo. The 
divergence of the w might come from only this power-law regime. Actually, 
we see this fact by evaluating the first two moments of the density function in 
the tail region. These two moments behave as E(t 2 ) ~ J max r~@~ 1 r 2 dr = t 2 ^ 
and E(t) ~ Jo ma * r~^~ l rdr = t]^ for (j3J) as t max — > oo. Thus, the average 
waiting time diverges linearly as a function of t max as w ~ CaxAmi = ^max- 
We can now define an effective probability density which approximates the 
Mittag-Leffler distribution as follows: 



p , r , Ps(t) (t<t x ) 

P M L{t)^{ (5) 

t p x +1 p s (t x )t-l-p (t>t x ) 

where Ps{t) is a stretched exponential distribution. With this approximation, 
one gets 



w(t x ,t max : (3) 



t 2 p s (t)dt + f x +1 p s (t x ) e-^dt 
/ f x t 2 p s (t)^t + t^Ps^y^ + 0(i) 

2 J t0 tP s (t)dt + 2^ +1 P 5 (t x ) tUt + 0(1) 
w s (t,t x :f3) + 0(l) (J3>2) 
feg (1 < /3 < 2) - (6) 

tmax (0 < /3 < 1) 

Notice that, for this approximant of the Mittag-Leffler function, it is mean- 
ingful to consider /3 > 1, as one can build a legitimate probability density (a 
non- negative function of positive reals normalized to 1) for any (3 > 0. How- 
ever, the Mittag-Leffler function is no longer a legitimate survival function for 
(3 > 1 as it assumes negative values. For (3 > 2 the approximant function has 
finite first and second moment and also the waiting time ws{t,t x : (3) has 
a finite value. Thus, if we were restricted to choose the parameter (3 within 
the range < (3 < 1, the average waiting time w would diverge as ~ £ ma x- If 
we could choose (3 > 2, we would obtain a finite value of the average waiting 
time, however, for f3 > 1, the approximate probability density has a maximum 
within the range t < oo. In Fig. [31 the behavior of the approximate density 
is shown for several values of the parameter (3 in the short time regime. From 
this figure, we find that the maximum appears for (3 > 1. This behavior is 
quite different from the empirical probability density function. Moreover, as 
mentioned above, for (3 > 1, the Mittag-Leffler distribution cannot be used. 
Thus, we should use a truncated Mittag-Leffler distribution and include a fi- 
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Fig. 3. The short-time-range behavior of the approximate probability density defined 
in eq. (|5|) for several values of (3. We set t x = 12. 

nite upper bound of the integral with respect to t, namely, the maximum value 
of the duration or the cut-off parameter t max . 

In the latter case, we have to face the following problem. Namely, how do 
we determine t max to obtain a reasonable w that is consistent with the result 
obtained from the empirical data analysis? Unfortunately, the estimate of 
w also depends on a second parameter, the crossover point t x at which the 
density function changes its shape from a stretched exponential-law to a power- 
law. If t x is close to t max , the value of w is not sensitive to the value of t max ; 
however, if t x is far from t max , w does depend on the value of t max because the 
integral of the power- law tail becomes dominant. These considerations lead 
to the conclusion that the Mittag-Leffler function is hard to use in order to 
evaluate the average waiting time for the market rates with a relatively long 
duration such as the Sony Bank USD/JPY exchange rate. 



4 The Gini index 

Another relevant statistic to specify the market rate with a long duration is 
the so-called Gini index, which denotes the inequality of the durations used 
this paper. In other words, fluctuation level in duration lengths can be simply 
described in terms of the Gini index. For a Weibull distribution, it was shown 
that the Gini coefficients given by both analytical prediction and empirical 
evidence coincide [15]. However, for the Mittag-Leffler survival function, it 
is not clear whether the analytical prediction of the Gini index is close to 
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the corresponding empirical evidence due to the tail-effect discussed in the 
previous section. Here, we study this issue. 



4-1 Analytical evaluation 



The Gini index G is defined as the area between the Lorentz curve defined 
below: (X(r),Y(r)) r G [0, 00] and the line Y = X corresponding to perfect 
equality, namely, 



r 7 dX 

G = J{X -Y)dX = j{X{r)-Y{r)}^--dr. (7) 



For the truncated Mittag-LefHer distribution, the Lorentz curve can be calcu- 
lated as 



XM= SSdtP ML (t:to,P) ~ (r/t o r 



Y(r) 



Jo max dtP ML {t : t , (3) ~ to 

SZdttP ML (t:t ,/3) 
J^dttP M L(t:t ,P) 

>M 



1 2^ n =0\ L ) T(3n+1) ^~ 1 ^r, 



'maxZ^„=0l / T(/3n+l) "I" max Z^n=0l ^ (/3n+l)r(/3n+l) 

y-oo / -i\ w (r/t f n 
' Z^„=0 'H -"-J (/3n+l)r(/3n+l) ^ 



E~ ^(-i) r 



QmaxAo)' 9 " 

(/3n+l)r(/3n+l) 



In Fig. HI we plot the Lorentz curve for the parameters f3 = 0.96 and to = 12 
(according to reference [H]), but with an effective upper bound of the integral 
set at r max = 100. In the same figure, we show the Lorentz curve for the the 
Poisson process, namely, for the exponential duration for which the curve can 
be written explicitly Y = X + (1 — X) log(l— X). From this figure, one can see 
that the area between the Lorentz curve for the Mittag-LefHer and the perfect 
equality line Y = X is larger than the area between the Lorentz curve for the 
Poisson process and Y — X. This means that the durations generated from 
the Mittag-LefHer survival function is more biased than that of the Poisson 
process. This fact can be justified by directly calculating Gini's index. For 
the Lorentz curve of the truncated Mittag-LefHer distribution, G is written as 
follows: 



G = 2 / 



( U \Bn -rV°° ( 1 \ n n ( r / t o) l3n 

{r/t f n r ^n=0l L ) 03n+l)r(/3™+l) 



n T(Bn + l) r V°° ( IV gfcWfoV! 
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Fig. 4. The Lorentz curve for the Mittag-Leffler survival function. We set to = 12 
and (3 = 0.96. We set the effective upper-bound of the integral r max — 100. We also 
plot the Lorentz curve for the Poisson process, namely, for the exponential duration 
for which the curve is written explicitly Y = X + (1 — X) log(l — X). Y = X is the 
perfect equality line. 

x lfv 1)n (r/t y^ 

to to 1 T((3n + (3) 

Or 00 (— ]) n (r /tn) 13 ^^' 1 

_ a 1 max v V J- J \i max/ ^0 J 

~^hho {n + l)T{(3n + (3) 

n oo oo (_-l\n+l( r If \P(n+l)+P~l 

-I iihix ^""^ 1 1 ' ' ' max/ ' 1 



^o^(n + / + i)r(^ + i)r(/3/ + /3) 

n(-l)"+'(r max /fo)^("+')+^- 1 

(10) 



2r max E n =o X^=0 ( / 3( n+ ;) +/ 3+i)( j a n+ i)r(/3n+l)r(/3/+/3) 



t n V°° ( 1>- w ( r ™a*Ao) gw 

u Z^„=ol (/3n+l)r(/3n+l) 

We plot the Gini index G as a function of (3 for to — 12 and r max = 100 in Fig. 
[5j From this figure, we find that the Gini index for (3 = 0.96 is G = 0.51 and G 
approaches 1/2 which is the Gini index for the exponential duration. For both 
the Lorentz curve and the Gini index, we set the effective upper-bound of the 
integral as r max = 100, however, we find that this statistic is free from the kind 
of divergence affecting the average waiting time w due to the upper-bound. 



4-2 Empirical data analysis 



Based on the method proposed in [T3], we obtain G = 0.59 for the BTP 
future with maturity June and G = 0.57 for the BTP future with maturity 
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Fig. 5. The Gini index as a function of (3 for the truncated Mittag-Lefner survival 
function. We set to = 12. The constant horizontal line G =1/2 corresponds to the 
Gini index for the exponential duration. We set the effective upper-bound of the 
integral r max = 100. 

September, whereas the theoretical prediction obtained in the previous section 
is 0.51. From these results, we find a manifest gap between the theory and 
empirical data analysis, however, this gap is relatively small in comparison 
with the gap for the average waiting time as we shall see later. 



5 A Weibull distribution with a power-law tail 

In previous studies, we found that a Weibull distribution is a good candidate 
to describe the Sony Bank USD/JPY exchange rate time statistic [13]. The 
average waiting time was also evaluated to investigate to what extent the 
Sony Bank rate is well-explained by the Weibull distribution [9lfH"] . We also 
found that the empirical result of the waiting time of Sony Bank USD/JPY 
exchange rate (~ 49.19 [min]) is more than twice of the average duration 
(~ 20.52 [min]). The situation is known as inspection paradox as discussed 
in the introduction. For the Weibull distribution, the paradox occurs when 
the Weibull parameter satisfies m < 1 as shown in Fig. [61 In this plot, we 
used the fact that E(t) = a 1/m (l/m)T(l/m), w = a 1/m T(2/m)/T(l/m) for a 
Weibull distribution fl3j) and the condition E(i) = w require l\ = (r(l/m)} 2 = 
mT(2/m) = l<i- The solution of this equation l\ = l 2 gives m = 1, and m < 1 
for li > l 2 means E(t) < w, vice versa [9]. This fact is intuitively understood 
as follows. When the parameter m is smaller than 1, the bias of the duration 
is larger than that of the exponential distribution. As a result, the chance 
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Fig. 6. li = r(l/m) 2 and I2 = mT(2/m) as a function of Weibull parameter m. At 
the intersection of both lines for m = 1, the average waiting time for the Weibull 
distribution is equal to the average duration. For m > 1, the average waiting time 
is longer than the average duration, whereas for m < 1, the so-called inspection 
paradox takes place |9j. 



for customers to check the rate within large intervals between consecutive 
price changes is more frequent than the chance they check the rate within 
short intervals. Then, the average waiting time could become longer than the 
average duration. 

The bias of the duration for the Weibull distribution with m < 1 is directly 
confirmed by means of the Gini index. It was shown that the analytical predic- 
tion of the Gini index calculated for a Weibull distribution is in good agreement 
with the value obtained from the empirical data of the Sony Bank rate |15j . 

However, there exists a significant small gap between the theoretical prediction 
(~ 44.62 [min]) and the empirical result for w (~ 49.19 [min]). 

In this section, we consider to what extent the average waiting time can be 
modified by taking into account a power-law behavior for the tail in the FPT 
distribution. In our previous paper, we assumed that the FPT of the Sony 
Bank rate might obey a pure Weibull distribution ([3]). However, several empir- 
ical data analysis have shown that the shape of the FPT distribution changes 
from a pure Weibull-law to a power-law at some crossover point t x . 

Therefore, here it is natural to assume that the FPT distribution should be 
modified as follows: 
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mtm " 1 cxp (-*—) (t <t ) 
P w (t:m,a n ,t x ) = { a v \ a)K XJ (11) 

At" 7 (*>**) 
Under the assumption of continuity at t x , the condition 



tx T A = f — ^ — J exp(-C/a) (12) 
is required. This condition determines the parameter A as 



,m+7-l , m 

A = mt^ — ftxy (13) 

a \ a J 

Thus, the modified FPT distribution is given by 



mtm ~ 1 cxp ( *) (t<t X/ 
P w (t :m,a,'y,t x ) = { mt t+i-i ,°,™ N ' ( 14 ) 



<__ exv t -y {t>t> 



From the FPT distribution, we have the average waiting time w from the 
renewal-reward theorem as follows. 



^— r ( — ) B ( — + 1, + -T^exp — M 



where B(a,x) denotes the following incomplete Gamma function: 



B(a,x) = — -r t^e-'dt. (16) 
1 (a) J 

v ' o 

The next problem is how to choose the parameters 7, t x , m and a. Fortunately, 
we know these parameters from empirical data analysis rr3"][T4] . Substituting 
those parameters 7 = 4.67, m = 0.585 and a = 49.63 into our formula ffl5|) . 
we evaluate the average waiting time w as a function of the crossover point 
t x . The result is plotted in Figure [71 In this figure, we present the average 
waiting time for three slightly different cases of m, namely, m = 0.58, 0.585 
and 0.59. From the empirical data analysis, we have t x ~ 18000 [s]. Therefore, 
we conclude that for m = 0.585, the waiting time is estimated as w = 45.66 
[min]. This value is much closer to the sampling value w = 49.19 [min] than the 
value obtained under the assumption of a pure Weibull distribution (w = 44.62 
[min]). Therefore, we conclude that a correction by taking into account the 
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Fig. 7. The average waiting time w as a function of t x . We set 7 = 4.67, a = 49.63. 
For two cases of the choice for m, namely, for m = 0.58, 0.585 and 0.59, the w is 
plotted. 

tail behavior of the Weibull distribution points into the right direction for 
estimating the average waiting time. 

The remaining gap Aw = 49.19 — 45.66 = 3.53 [min] might be due to a 
rough estimation of the crossover point t x . In the next section, we propose 
a systematic procedure to determine the appropriate crossover point t x so as 
to minimize the gap Aw by considering the non-monotonic behavior of the 
average waiting time w with respect to t x as shown in Fig. [7J 

5. 1 Intuitive explanation of the non-monotonic behavior 

From Fig. [7J we find a non-monotonic behavior in the curve of the average 
waiting time as a function of t x . The intuitive explanation is given as follows. 
In Fig. [HI we show the Log-Log plot of the survival function of the Weibull, 
the power-law and the empirical data for the regime t > 8000 [s]. In this 
figure, we set the crossover point t x = 10000 [s] to determine the normalization 
constant for the power-law distribution. Then, we find that there exists another 
intersection between the Weibull and the power-law distributions at t ~ 30000 
[s]. As the results, we obtain two distinct areas which are surrounded by the 
two lines, namely, the Weibull and the power-law distributions. Let us call 
these two areas as A\ (the left part) and Ai (the right part), respectively. It 
should be noted that the difference e between the empirical distribution and 
the Weibull distribution with a power-law tail is proportional to the difference 
of these two areas, that is, 



14 



eoc^li — Ai- 



(17) 



We should bear in mind that the above difference is dependent on the choice 
of the crossover point t x . 

In the following, we shall show that there exists an optimal crossover point 
t x = C at which the difference e is minimized. 

• 10000 < t x < C 

For this case, as shown in the upper left of Fig. [SJ, the curve of the power- 
law distribution goes up as the t x increases. As the result, the area A\ 
decreases, whereas the area Ai increases. Then, the difference e, namely, 
the gap between the empirical distribution and the Weibull distribution 
with a power-law tail decreases. 

• t x =C 

For this shown in the upper right of Fig. [SJ, the area Ai vanishes 

and the two distinct lines are degenerated to a single curve at tx — C . 
Then, the difference e is minimized and the averaged waiting time with true 
parameters obtained by empirical data analysis takes its maximum. 

• t x > C 

For this case, as shown in the lower Fig. [SJ the curve of the power-law 
distribution goes down further and the single intersection at t x = C moves 
to the right. As the result, the area Ai increases with the decreasing of 
the area Ai- Thus, from the definition of the difference ffT7]) . we find that 
for this case, the gap between the empirical distribution and the Weibull 
distribution with a power-law tail starts to increase again. 

From the above observation, we conclude that the predicted average waiting 
time takes its maximum at t x = C. 

Taking into account the above fact, we might determine the optimal crossover 
point t x for which the gap between the average waiting time for the empirical 
data and for the Weibull distribution with a power-law tail is minimized. In 
the next subsection, we shall discuss this issue. 

5.2 Determination of the optimal crossover point 

Let us consider the case m = 0.585 in Fig. [7| which was evaluated from the 
Sony Bank rate by using the Weibull paper analysis. The error due to the 
wrong estimation for the true FPT distribution Pr{t) can be divided into two 
parts, namely, the difference between the true (empirical) distribution and 
the Weibull distribution: Ew, and the difference between the true distribution 
and the power-law distribution: Ep ower . Thus, the total difference e is written 
in terms of the area between the true curve Pr(t), which is evaluated from 
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Fig. 8. The empirical distribution calculated from the Sony Bank rate and Weibull 
distribution with a power-law tail for t x = 10000 [s] (upper left), 24000 [s] (upper 
right) and 40000 [s]. 

empirical data analysis, and the Weibull distribution with a power-law tail. 
Then, we have the area 



e = J \P T (t) - P w (t)\dt + J \P T (t) - P Power (t)\dt 



= £w(t x ) + £ Power {t> 



which is proportional to the gap between the true value of the average waiting 
time and the same quantity estimated by the Weibull distribution with a 
power- law tail. In the limit of t x — > oo, the difference e leads to only the 
Weibull contribution ew(oo), whereas the difference e is identical to only the 
power-law contribution ep ower (0) for t x — ► 0. It is possible to show that there 
is a specific crossover point t x at which the difference e takes its minimum. 

Indeed, we first notice that the absolute values in equation (jl~8l) can be re- 
moved by taking into account the relationships between the magnitudes of 
the three distributions Pr(t), Pw(t) an d Ppowerif)- Then, it is possible to take 
the derivative of e with respect to t x as follows. 
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—P\y(t x ) + Ppower(tx) (Pt > P\v) A \Pt > Ppower) 




— 2Pr(t x ) + Pwitx) — Ppower(t x ) (Pt < Pw) A (Pt > Ppower) 
^ Pw(tx) — Ppower(tx) (Pt < P\v) A (Pt < Ppower) 



In order to show that the e takes its minimum at finite t x , we prove that there 
is a value of t x which satisfies de/dt x = 0, that is, 



for ((P T > P w ) A (P T > P PoW er)) V ((P T < P w ) A (P T < P PoW er)), and 



Actually, we defined the Weibull distribution with a power-law tail to satisfy 
Pw(t x ) = Ppower(tx) = Pr(t x ) m order to approximate the true empirical 
distribution. In the previous section, we obtained the Weibull distribution 
with a power-law tail <HM by taking into account the condition P w (t x ) = 
Ppower(t x ), namely, the continuity between two curves at the crossover point. 
As the value of the empirical distribution at t — t x , namely, Pr(t x ) is close 
to the theoretical prediction Pw(t x ) (or of course Pp OW er(tx)) for m = 0.585, 
the condition Pw(t x ) = Pp OW er(tx) — Pr(tx), namely, both (120]) and ff2~T]) are 
satisfied. 

Therefore, our statement holds true: there exists a crossover point t x at which 
the difference e takes its minimum. The non-monotonicity of the curve of the 
average waiting time is nothing but an effect of the fact that the difference e 
is minimized for the intermediate value of t x . 

This is another intuitive explanation for the non-monotonic behavior of the 
average waiting time as a function of t x . However, to determine the value t x , 
we need more information. Then, we use the fact discussed in the previous 
subsection, namely, the difference between the empirical distribution and the 
Weibull distribution with a power-law tail is proportional to the difference of 
the two distinct areas A± — A2- The difference e is written in terms of the 
distribution Pw(t) and Pp OW er(t) as follows. 




(20) 



2P T (t x ) = P w (t x ) + P P 



(21) 



for ((P T > P w ) A (P T < P PoW er)) V ((P T < Pw) A (P T > P/wr))- 



e = 




o 
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oo 



rn 



a 



I 
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t'^dt 



(22) 



o 



o 



where we used the explicit forms of the distributions Pwif) and Pp OW er(t) to 
obtain the second line of the above equation. Then, we take the derivative of 
e with respect to t x and set it to zero, that is de/dt x = 0, in order to obtain 
the necessary condition to let e take its maximum at t x . Then, we have 



This value £* might be a candidate to give an optimal crossover point for which 
the gap of the average waiting time w for the empirical distribution and for the 
Weibull distribution with a power-law tail is minimized. To compare the value 
for the true parameter set (m, a, 7) obtained from the empirical data analysis 
with that obtained in the previous subsection C ~ 24000 [s], we substitute 
the values m = 0.585, a = 49.63 and 7 = 4.67 into the above expression (1231) 
and immediately obtain 



This result is very close to the value C ~ 24000 in the previous section. 
Inserting the above crossover point t* with the other parameters (m, a, 7) 
estimated by empirical data analysis into the expression ( |T5l) . we obtain the 
average waiting time for the Sony Bank rate as w = 46.25 [min]. Then, the gap 
A is estimated as Aw = 49.19 — 46.25 = 2.94 [min]. Therefore, the correction 
obtained by modifying the crossover point reduces the gap Aw between the 
empirical and the theoretical predictions from Aw = 3.53 [min] to Aw = 2.94 
[min] 

Thus, we obtained a formula to determine the appropriate (and may be an 
optimal) crossover point t* for our proposed first-passage time distribution, 
that is, the Weibull distribution with a power-law tail. It is important to stress 
that formula (l23l) is rather general and can be always applied to data described 
by a Weibull distribution with power-law tail. 

5.3 On the sign of the second derivative of e to confirm that e takes its 
minimum at t x 

In order to confirm that e takes its minimum (not its maximum) at t x , we 
can evaluate the sign of the second derivative of e with respect to t x , that is, 
d 2 e/dt 2 x . One can label the cases in equation (IT91 as follows: 




(23) 



t* x ~ 23538.3 [s]. 



(24) 
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dt 
dt x 



—P\y(t x ) + Ppower(tx) (Pr > P\v) A (Pr > Ppower) (A) 

2-Pt(^x) — Pw\tx) + Ppower(tx) (Pt > Pw) A (Pr < Ppower) (B) 

— 2Pp(tx) + Pvk(^x) — Ppower(t x ) (Pt < Pw) A (Pt > Ppower) (C) 

^ Pvk(^x) — Ppoweritx) (Pr < Pw) A (Pp < Ppowe 



It should be kept in mind that the empirical data should fall in one of the 
above four categories: case A, case B, case C, case D. In addition, we should 
notice that each conjuncted condition in case A is opposite to case D, and 
each conjuncted condition in case B is opposite to case C with respect to the 
sign of the second derivative of e at t x . Therefore, we should check whether 
the second derivative of e takes positive value or not in each case. 

For any case, we need the derivatives dPw/dt x , dP power /dt x and dPT/dt x . 
The last one dPx/dtx denotes a derivative of the empirical distribution at t x 
and we should evaluate of the derivative numerically from the emprical data. 
However, the empirical data analysis suggests (Pt > Pw) A (Pr > Ppower) 
is true (case A) (see Figure [2]). Actually, we do not need the evaluation of 
dPr/dtx and we need only the above first two derivatives. We obtain them 
analytically 



dP 



dt, 



dP 



power 



dP 



w 



dt 



dP, 



power 



dts 



dt 



t=t> 



t=u 



m 
—t 



m—2 
X ' 



m 



im 2 hL 
t v e a < U. 



(25) 
(26) 



We should notice that the sign of dP power /dt x is negative for any choice of 
the parameters 7, m, a and t x . However, the sign of the dP\y/dt x depends on 
the parameters. For instance, for t x = 0, dP\y/dt x > for m > 1, whereas, 
dPw/dt x < for m < 1. 



Then, we evaluate the differences: 



dPpower dPy/ 771 m _ 2 -t*. n /, \ 

—f, = — *x e a D {tx) (27) 

dt x dt x a 

where we defined 



7T? 

P(t x ) = -t™- 7 -m + l. (28) 
a 

In Figure [lOj we plot the D as a function of t x for parameter values m = 
0.585, a = 49.63 and 7 = 4.67 as in the case of the Sony Bank data. From 
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Fig. 10. The behavior of the function D as a function of t x . We set 
m = 0.585, a = 49.63 and 7 = 4.67 as the Sony Bank data. 

this figure, we find that at some critical point t* ~ 23538.3 [s], the sign of the 
function D changes. By taking into account the fact that the crossover point 
used here is C = t x = 24000 > t* , we conclude that 



dPpower ^ dPw (29) 



dt x dt x 



Therefore, considering that the empirical data analysis suggests (Pt > Pw) A 
(Pt > Ppower) is true (case A), we prove that e takes its minimum at t x . 
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In order to discuss t x in more detail, we start from equation (1221) and take the 
derivative of e with respect to t x 





By taking de/dt x = 0, one obtains (1231) as the solution t x and the value for 
the empirical data is given by (1241) . To confirm that the solution t x gives the 
maximum of the e, we check the sign of the second derivative of e. We find 



d 2 e m 



e a t 



.m+7— 3 



dt 2 x a 

'm\ 2 ~ mt 



,2m _ "^x 

a 



ioo 
J t^'dt. (31) 
n 



Therefore, by replacing (123]) . namely, t x = t* into the above expression, we 
get 



d 2 e 



dtl 



m(m + 7 — l) 2 e" 



m + 7— 1 



t x =tl 



— (m + 7-I) 



m 



7-3 OO 



rut 



(32) 



and we conclude that the e takes its maximum at t x = t* x , that is, d 2 e/dt 2 x > 
for the solution of de/dt x =0. Therefore e has a minimum at t* . 



6 Application to BTP future data 



It is now interesting to see what happens when we apply the Weibull distribu- 
tion with a power-law tail to another financial data set and a different random 
variable. As mentioned several times, in the BTP future case, we study inter- 
trade durations and not first-passage times. In this section, we evaluate the 
average waiting time for the BTP future. Then, we investigate to what extent 
our formulation is applicable and we also discuss the limits of that formulation. 



6.1 Weibull-paper analysis for the BTP future 



To evaluate the average waiting time w for the Weibull distribution with a 
power law-tail, we estimate the parameters a,m and t x from the available 
empirical data. To this purpose, we carry out the so-called Weibull-paper 
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analysis. We show the result in Fig. [TTJ To produce this Weibull paper, we 
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Fig. 11. Weibull paper analysis for the data t < 200 [s]. 

used data up to t = 200 [s] (about 99.7% of the whole data set). From this 
figure, we find that there are apparent gaps between the empirical plot and the 
Weibull paper (straight line). Nevertheless, from the Weibull paper analysis, 
we obtain m = 0.85 and a = 10.02. For these parameters, the average waiting 
time estimated by the renewal-reward theorem leads to 



,r(- 

w = a™ — -r 



30.0 [si 



(33) 



whereas, by sampling from the empirical data, we obtain (t) ~ 16.5511 [s] and 



w ■ 



<0 
2(t) 



530.1 [si = 8.8 [mini 



(34) 



since from the empirical data one finds w > (t), the inspection paradox oc- 
curs for the BTP future data. Moreover, the empirical result is far from the 
theoretical prediction f[33l . The reason for the large gap might come from the 
bad fit of the empirical data by means of a pure Weibull distribution. 

We next reduce the range to fit the data by Weibull paper analysis from 
t = 200 [s] to t = 50 [s] which is about 96.1 % of whole data points. In Fig. 
IT2l we display the Weibull paper and obtain the parameters as m = 0.99 and 
a = 16.49. By making use of these parameters, the theoretical prediction of 
the average waiting time leads to w ~ 16.70 [s]. This value is very close to the 
value of the first moment for the empirical data (t). This result tells us that 
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the Weibull paper analysis for the data point up to t = 50 gives almost the 
same prediction as an exponential distribution for the duration of the BTP 
future. 
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Fig. 12. Weibull paper analysis for the data t < 50 [s]. 



6.2 Weibull distribution with a power-law tail for the BTP futures 



We now evaluate the optimal crossover point i* from the parameters obtained 
by empirical data analysis. Inserting these values m = 0.70, a = 6.05 and 
7 = 1 + (3 — 1.96 for t — 200 data points into our formula ( 1231) . we have 



t* x ~44.9[s]. (35) 

On the other hand, when we use the t = 50 data points, we use the values 
m = 0.99, a = 16.49 and 7 = 1.96 into the expression (123]) and obtain 



t* x ~33.5[s]. (36) 

We next evaluate the average waiting time w by using the formula (ji~5l) which 
was corrected by means of the power-law tail effect. In Fig. H3J, we plot the 
corrected average waiting time w for the BTP future as a function of t x for 
both cases of t, namely, for t = 200 (m = 0.70, a = 6.05) and 50 (m = 0.99, a = 
16.49). From this figure, we see that, at the predicted optimal crossover point 
t* , both curves take their maximum, however, the values of the average waiting 
time are lower than (to make matters worse, for large t x it becomes negative) 
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Fig. 13. The corrected average waiting time w for the BTP future as a function of 
t x . We set 7 = 1.96. For two cases of the choice for the number of data points t, 
namely, for t = 200 (m = 0.70, a = 6.05) and 50 (m = 0.99, a = 16.49), the w are 
plotted. 

those estimated by a pure Weibull distribution. This is because the second 
terms appearing in both numerator and denominator in the formula becomes 
negative for the parameter range of 7 < 2. Thus, we conclude that the BTP 
future has too heavy a tail (7 = 1.96) to correct the average waiting time 
by using the formula (|T5|) . This is a limitation of our formula for the average 
waiting time for financial data. 



7 Summary and discussion 

In this paper, we have compared a Weibull distribution and a Mittag-Leffler 
distribution. Then, two relevant statistics, namely, the average waiting time 
and the Gini index have been studied in both cases. Our theoretical analysis 
revealed that the average waiting time diverges linearly as a function of the 
cut-off parameter t max for the Mittag-Leffler distribution. This fact implies a 
more difficult treatment to check the validity of modeling the market renewal 
process by means of the Mittag-Leffler distribution. On the other side, the 
Gini index for the Mittag-Leffler survival function is free from this kind of 
divergence because the tail part of the duration distribution does not con- 
tribute to the value so much. We also find that a Weibull distribution with 
a power-law tail is an efficient way to describe renewal processes in markets 
with a long duration such as the Sony Bank USD/JPY exchange rate seen 
as a first-passage process. We conclude that the Weibull distribution with a 
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power-law tail is more suitable to evaluate the relevant statistics for financial 
markets with a long duration. By considering the intuitive explanation of the 
non-monotonic behavior of the corrected average waiting time as a function 
of the crossover point, we obtained a useful formula to decide the appropriate 
(and might be an optimal) crossover point t* . In fact, we could reduce the gap 
of the average waiting time Aw between the theoretical and empirical data 
analysis from Aw = 4.57 [min] (for a pure Weibull distribution) to Aw = 2.94 
[min] by evaluating the average waiting time with the optimal crossover point 
t* and the parameter set (m, a, 7) obtained by the empirical data analysis of 
the Sony Bank rate. To investigate the limitation of our distribution to de- 
scribe the other financial data, we applied our distribution to the BTP future. 
We found from the Weibull paper analysis that for the short range duration 
regime, there exist apparently gaps between the empirical and our proposed 
distributions. To make matters worse, we concluded that the BTP future has 
too heavy tail to obtain the correction for the average waiting time by means 
of our formula ffl5l) . From these observations, we could say that our proposed 
distribution, namely, the Weibull distribution with a power-law tail is appli- 
cable to the financial data having the following two properties. 

• In short duration regime, it follows a Weibull-law. 

• It does not have too heavy tail, namely, 7 > 3 should be needed. 

If the above two conditions hold in the financial data, the duration of the data 
might be well described by our proposed distribution. 

We hope that our proposed method will be widely used as a powerful candidate 
to describe the duration in financial data having the above two properties. 
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